
GCCAGGTCTGGCACCATGCACTAGGATACCCAGAACGCTGCAAGGCCACGCC 

CTCCTCACTTCAGGGGTCACTCTCCCCATTGCCCACCACCCCACCATGGCTGGG 

GATCGGCTCCCGAGGAAGGTGATGGACGCAAAGAAACTGGCCAGCCTGCTGC 

GTGGCGGGCCTGGGGGACCCrrGGTCATCGACAGCCGGTCCTTCGTGGAGTAT 

AACAGCTGCCACGTGCTGAGCTCTGTGAATATCTGCTGTTCAAAGCTGGTGAA 

GCGGCGCCTTCAGCAGGGAAAAGTGACAATTGCTGAGCTTATCCAGCCTGCTA 

CACGGAGCCAGGTGGATGCCACAGAACCACAGGATGTAGTGGTGTATGACCA 

GAGCACACGAGATGCCAGCGTGCTGGCAGCAGACAGCTTCCTGTCCATCCTGC 

TCAGCAAGCTGGACGGCTGCTTCGACAGTGTGGCCATCCTCACAGGAGGCTTC 

GCCACCTTCTCCTCCTGCTTCCCTGGCCTCTGTGAGGGCAAGCCTGCCACTCTA 

CCGTCCATGAGCCTCTCTCAGCCCTGCCTGCCTGTGCCCAGTGTTGGCCTGACC 

CGAATCCTGCCTCACCTCTACCTGGGCTCTCAGAAAGATGTCTTGAACAAGGA 

TCTGATGACCCAAAACGGAATAAGCTATGTCCTCAATGCCAGCAACTCCTGCC 

CTAAACCGGACTTCATCTGTGAGAGCCGTTTCATGCGTATCCCCATCAATGAC 

AACTACTGTGAAAAGCTGCTGCCCTGGCTGGACAAGTCCATCGAGTTTATTGA 

TAAAGCCAAGCTGTCCAGCTGCCAAGTCATTGTTCACTGTCTGGCTGGCATCTC 

TCGCTCTGCCACCATTGCCATCGCGTACATCATGAAAACCATGGGCATGTCTTC 

TGACGACGCATACAGGTTTGTGAAGGATCGGCGCCCCTCCATCTCGCCCAACT 

TCAACTTCCTGGGCCAGTTGCTGGAGTATGAGAGGAGTCTGAAGCTGCTGGCT 

GCCCTGCAGACTGATGGACCTCACTTGGGGACCCCTGAGCCCCTCATGGGCCC 

GGCAGCAGGCATCCCACTGCCCCGGCTGCCACCATCTACCTCAGAGAGCGCTG 

CCACTGGGAGCGAGGCAGCCACCGCAGCCAGGGAGGGCAGCCCAAGTGCTGG 

AGGGGATGCTCCGATCCCCAGCACAGCTCCAGCCACCAGCGCGCTGCAGCAG 

GGCCTGCGTGGCCTGCACCTCTCCTCTGACCGCCTCCAGGACACCAACCGCCT 

CAAGCGTTCCTTTTCCCTGGACATCAAGTCGGCCTATGCACCCAGCAGGAGGC 

CCGACTTTCCCGGCCCACCCGACCCCGGTGAAGCCCCGAAGCTCTGCAAGCTG 

GACAGCCCGTCTGGGGGCACACTGGGCCTGCCCTCGCCCAGCCCAGACAGCCC 

GGACTCCGTTCCAGAGTGCCGCCCACGACCCCGCCGGCGACGCCCCCCGGCTA 

GTTCGCCTGCCCGCTCCCCCGCGCATGGTCTGGGCCTGAACTTTGGAGACACG 

GCCCGGCAGACTCCACGGCACGGCCTCTCGGCCCTGTCGGCGCCCGGGCTGCC 

TGGCCCTGGCCAGCCGGCTGGCCCCGGGGGCTGGGTGCCGCCACTGGACTCCC 

CAGGCACACCGTCGCCCGACGGCCCCTGGTGCTTCAGCCCCGAGGGCGCGCA 

GGGTCCAGGCGCTGTGTTCTCCGCCTTTGGCCGGGTAAGTGCAGGCGCACCTG 

GACCCGGTAACAGCAGCAGCAGCGGTGGTGGTGGTGGTGGTGGTGGCGGCGG 

CGGCGGCGGCGGCGGCGGCGGCGGCAGCAGCAGCAGCAACAGCAGCAGCAG 

CAGCAGCAGCAGCAGCAGCAGCAGCAGCAGTAGTAGTAGTAGTAGTGACCTG 

CGGAGGCGGGATGTGCGGACCGGCTGGCCCGAGGAGCCTGCTGCAGATGCAC 

AGTTCAAGAGGCGCAGCTGCCAGATGGAGTTCGAAGAGGGCATGGTGGAGGG 

GCGGGCACGTGGCGAGGAGCTGGCAGCCCTGGGCAAGCAAACCAGCTTCTCT 

GGCAGCGTGGAGGTCATCGAAGTATCGTGACCCTTCAGAAGTCCCTGTGCCCT 

TGCTCCAGCCAGGCCAGGTATAAATATATATTATATATAAAACACACAGAAAA 

GGTAAATGGTTTTACTGCAATTTTTATCAAGAAGTAAATATTTCGATTTTTTAT 

TTATTTAAGCTAGTGATCTGGCAACTGTGCGGGGCGGCCCTAAAGCTCTGTTTT 

TACTGTCTGGTATTTAAACTGAAACAGGTTTCTAAGCAATATGAGGCCACCTT 

CAATCCCAAACTGGGTTGACAGGCCTGGGCCCCTCCTTGCCCCTCCCCTCTGG 

AAACATTACTGACCTTTCAAAGAGCTGCCCAGCTTTCCTGCACTTTTTACATAA 

GA A AA A AGGGGGGGGGGGA A (SEQ ID NO: 1 ) 
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underlined = deleted in targeting construct 

[ ] = sequence flanking Neo insert in targeting construct 



[GCCAGGTCTGGCACCATGCACTAGGATACCCAGAACGCTGCAAGGCCACGCCCTCCTCAC 
TTCAGGGGTCACTCTCCCCATTGCCCACCACCCCACCATGGCTGGGGATCGGCTCCCGAG 
GAAGGTGATGGACGCAAAGAAA ] CTGGCCAGCCTGCTGCGTGGCGGGCCTGGGGGACCCTT 
GGTCATCGACAGCCGGTCCTTCGTGGAGTATAACAGCTGCCACGTGCTGAGCTCTGTG7VA 
TATCTGCTGTTC7VAAGCTGGTGAAGCGGCGCCTTCAGCAGGGAAAAGTGACAATTGCTGA 
GCTT [ ATCCAGCCTGCTACACGGAGCCAG] GTGGATGCCACAGAACCACAGGATGTAGTGGT 
GTATGACCAGAGCACACGAGATGCCAGCGTGCTGGCAGCAGACAGCTTCCTGTCCATCCT 
GCTCAGCAAGCTGGACGGCTGCTTCGACAGTGTGGCCATCCTCACAGGAGGCTTCGCCAC 
CTTCTCCTCCTGCTTCCCTGGCCTCTGTGAGGGCAAGCCTGCCACTCTACCGTCCATGAG 
CCTCTCTCAGCCCTGCCTGCCTGTGCCCAGTGTTGGCCTGACCCGAATCCTGCCTCACCT 
CTACCTGGGCTCTCAGAAAGATGTCTTGAACAAGGATCTGATGACCCAAAACGGAATAAG 
CTATGTCCTCAATGCCAGCAACTCCTGCCCTAAACCGGACTTCATCTGTGAGAGCCGTTT 
CATGCGTATCCCCATCAATGACAACTACTGTGAAAAGCTGCTGCCCTGGCTGGACAAGTC 
CATCGAGTTTATTGATAAAGCCAAGCTGTCCAGCTGCCAAGTCATTGTTCACTGTCTGGC 
TGGCATCTCTCGCTCTGCCACCATTGCCATCGCGTACATCATGAAAACCATGGGCATGTC 
TTCTGACGACGCATACAGGTTTGTGAAGGATCGGCGCCCCTCCATCTCGCCCAACTTCAA 
CTTCCTGGGCCAGTTGCTGGAGTATGAGAGGAGTCTGAAGCTGCTGGCTGCCCTGCAGAC 
TGATGGACCTCACTTGGGGACCCCTGAGCCCCTCATGGGCCCGGCAGCAGGCATCCCACT 
GCCCCGGCTGCCACCATCTACCTCAGAGAGCGCTGCCACTGGGAGCGAGGCAGCCACCGC 
AGCCAGGGAGGGCAGCCCAAGTGCTGGAGGGGATGCTCCGATCCCCAGCACAGCTCCAGC 
CACCAGCGCGCTGCAGCAGGGCCTGCGTGGCCTGCACCTCTCCTCTGACCGCCTCCAGGA 
CACCAACCGCCTCAAGCGTTCCTTTTCCCTGGACATCAAGTCGGCCTATGCACCCAGCAG 
GAGGCCCGACTTTCCCGGCCCACCCGACCCCGGTGAAGCCCCGAAGCTCTGCAAGCTGGA 
CAGCCCGTCTGGGGGCACACTGGGCCTGCCCTCGCCCAGCCCAGACAGCCCGGACTCCGT 
TCCAGAGTGCCGCCCACGACCCCGCCGGCGACGCCCCCCGGCTAGTTCGCCTGCCCGCTC 
CCCCGCGCATGGTCTGGGCCTGAACTTTGGAGACACGGCCCGGCAGACTCCACGGCACGG 
CCTCTCGGCCCTGTCGGCGCCCGGGCTGCCTGGCCCTGGCCAGCCGGCTGGCCCCGGGGG 
CTGGGTGCCGCCACTGGACTCCCCAGGCACACCGTCGCCCGACGGCCCCTGGTGCTTCAG 
CCCCGAGGGCGCGCAGGGTCCAGGCGCTGTGTTCTCCGCCTTTGGCCGGGTAAGTGCAGG 
CGCACCTGGACCCGGTAACAGCAGCAGCAGCGGTGGTGGTGGTGGTGGTGGTGGCGGCGG 
CGGCGGCGGCGGCGGCGGCGGCGGCAGCAGCAGCAGCAACAGCAGCAGCAGCAGCAGCAG 
CAGCAGCAGCAGCAGCAGCAGTAGTAGTAGTAGTAGTGACCTGCGGAGGCGGGATGTGCG 
GACCGGCTGGCCCGAGGAGCCTGCTGCAGATGCACAGTTCAAGAGGCGCAGCTGCCAGAT 
GGAGTTCGAAGAGGGCATGGTGGAGGGGCGGGCACGTGGCGAGGAGCTGGCAGCCCTGGG 
CAAGCAAACCAGCTTCTCTGGCAGCGTGGAGGTCATCGAAGTATCGTGACCCTTCAGAAG 
TCCCTGTGCCCTTGCTCCAGCCAGGCCAGGTATAAATATATATTATATATAAAACACACA 
GAAAAGGTAAATGGTTTTACTGCAATTTTTATCAAGAAGTAAATATTTCGATTTTTTATT 
TATTTAAGCTAGTGATCTGGCAACTGTGCGGGGCGGCCCTAAAGCTCTGTTTTTACTGTC 
TGGTATTTAAACTGAAACAGGTTTCTAAGCAATATGAGGCCACCTTCAATCCCAAACTGG 
GTTGACAGGCCTGGGCCCCTCCTTGCCCCTCCCCTCTGGAAACATTACTGACCTTTCAAA 
GAGCTGCCCAGCTTTCCTGCACTTTTTACATAAGAAAAAAGGGGGGGGGGGAA 
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Gene Sequence 
Structure * 



143 bp 



Sequence Deleted 



304 bp 



Size of full-length 
cDNA: 2453 bp 



Targeting Vector* 
(genomic sequence) 



Arm Length: 

5': 2.5 kb 
3': 2 kb 



5 arm 



LacZ-Neo 
Cassette 



3' arm 



3' probe 




Targeting Vector 
Endogenous Locus 



* Not drawn to scale 



5 ' > TC C TGGGAGC C AGC T ATAGC T 
ACCCAGATCCCACCATCTGCTGAC 
TATTCACCTTTCCCCCAGGTCTGG 
CACCATGCACTAGGATACCCAGAA 
CGCTGCAAGGCCACGCCCTCCTCA 
CTTCAGGGGTCACTCTCCCCATTG 
CCCACCACCCCACCATGGCTGGGG 
ATCGGCTCCCGAGGAAGGTGATGG 
ACGC AAAGAAA< 3 1 
(SEQ ID NO: 2) 



5 ■ >ATCCAGCCTGCTACACGAAGC 
CAGGTACCTGTGGCCCACCCTTGC 
ATGCGTCTTCAGGGCTGACCATTC 
C TGAGC AAAC AG AC C T ATGTC AC C 
TCTGAAAGAGACAGAGGAGCTCCC 
AGGCCTGGTGCCAAGAGTCCTCTG 
ATAAGGCATTTCCCCCTCGCTGTC 
CCTCCGTTCCAAACAGGGTTCCTT 
GGGGTC AGAGC < 3 ' 
(SEQ ID NO: 3) 
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